232
15
The Molecules of Life
odic table of the elements in chemistry. Indeed, the dehydron concept is needed to
computationally fold a peptide chain ab initio.
Examination of protein–protein interaction interfaces fully bears out the dehy-
dron interpretation. Appropriate complementarity is achieved by overexposed apolar
groups and dehydrons (rather than H-bond acceptors and donors, or positively and
negatively ionized residues, although these may play a minor rôle). One also notes
that each subunit of haemoglobin, a very stable and soluble (i.e., nonsticky) protein,
has just three dehydrons: Two are at the interface with the other subunits, and one is
the bond connecting residues 5 and 8 (i.e., flanking the sickle cell anaemia mutation
site at residue 6). In contrast, the prion protein, which is pathologically sticky, has
an extraordinarily high density of dehydrons (mean rhoρ is only about 11).
There are also evolutionary implications. It has long been realized that the evolu-
tion of proteins via mutations in their corresponding genes is highly constrained by
the need to maintain the web of functional interactions. There is a general tendency
for proteins in more evolved species to be able to participate in more interactions;
they have more dehydrons. For example, mollusk myoglobin is a perfectly wrapped
protein and functions as a loner. Whale myoglobin is in an intermediate position, and
human myoglobin is poorly wrapped, hence sticky, and operates together with other
proteins as a team. Although the folds in a protein of a given function are conserved
as species diverge, wrapping is not (even though the sequence homology might still
be as much as 30%). Structural integrity becomes progressively more reliant on the
interactive context as a species becomes more advanced.
A corollary is that the proteins of more complex species are also more vulnerable
to move into pathological states. The prion diseases form a good example; they are
unknown in microbes and lower animals. Moreover, they mainly attack the brain,
the most sophisticated and complex organ in the living world.
15.5.3
Protein Structure Determination
High-throughput methodology (also called structural genomics) comprises the
following steps:
1. Select the gene for the protein of interest.
2. Make the corresponding cDNA.
3. Insert the cDNA into an expression system.
4. Grow large volumes of the protein in culture (if necessary with appropriate iso-
topic labelling of C and N).14
5. Purify the protein (using affinity chromatography).
14 For some of the problems associated with the production of recombinant proteins, see Protein
production and purification, Nature Methods 5 (2008) 135–146.